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SPECIFICATION 
THERMOPHILIC ENZYMES HAVING jS -GLYCOSIDASE ACTIVITY 

BACKGROUND OF THE INVENTION 

The present invention relates to a thermophilic enzyme 
having ^ - glycosidase activity. More particularly, the 
present invention relates to a thermophilic enzyme having 
P -glycosidase activity derived from a hyperthermophil ic 
bacterium belonging to the genus Pyrococcus. 

0 - Glycosidases are useful for hydrolysis of 
saccharides, DNA sequencing, conformational analysis of 
glycoproteins and glycolipids, and enzymatic synthesis of 
oligosaccharides and he terosaccharides with high optical 
purities. The catalytic reaction of /3 -glycosidases with 
substrates is specific with respect to the types of the 
monosaccharides constituting the substrates, and the 
optical isomerism and the position of the glycosidic 
linkage to be cleaved in the substrates. 0 -Glycosidases 
are also useful for the modification of sugar chains and 
the synthesis of oligosaccharides and polysaccharides 
retaining their optical stereoisomerism, as well as the 
synthesis of heterosaccharides (e.g., biosurf ac tants ) due 
to their ability to transfer a glycoside group into a 
primary, secondary or tertiary alcohol . Hitherto, various 
types of 0 -glycosidases with different substrate- 
specificities have been found in bacteria and plants. 
However, since many of such 0 -glycosidases are derived 
from mesophilic organisms, they are poor in thermal 



resistance, and consequently are unsuitable for use in 
synthetic reactions under such extreme conditions that 
organic solvents are used simultaneously. 

If a thermophilic 0 -glycosidase active in organic 
solvents is found, this can be used as an biocatalyst to 
develop a new procedure for synthesizing a 
heterosaccharide with high optical purity. In this 
procedure, the reverse hydrolytic reaction (i.e., 
synthetic reaction) is utilized which predominately occurs 
in the presence of an organic solvent. Under the 
circumstances, a novel jS -glycosidase which is active under 
extreme conditions has been strongly demanded. 

SUMMARY OF THE INVENTION 

An object of the present invention is to provide a 
thermophilic enzyme with j8 -glycosidase activity. 

For solving the above-mentioned problems, the present 
inventors focused on hyper thermophi 1 ic bacteria capable 
of growing within the temperature range from 90 to 100° 
C. As a result, they have found a gene that is assumed to 
encode a protein having 0 -glycosidase activity from its 
nucleotide sequence. The inventors have succeeded in the 
production of an enzyme from the gene by introducing the 
gene into Escherichia coli cells to transform the cells 
and then producing the enzyme from the transf ormants , which 
enzyme was confirmed to be stable at high temperatures (90° 
C or higher) and to have $ -glycosidase activity. This 
success leads the accomplishment of the invention. 

That is, the present invention provides a thermophilic 
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enzyme having jS -glycosidase activity which comprises the 
amino acid sequence of SEQ ID NO: 2 in which one or a 
plurality of amino acid residues may be deleted, replaced 
or added. The number of the amino acid residue which may 
be deleted, replaced or added in the amino acid sequence 
of SEQ ID NO: 2 is not particularly limited as long as the 
/3 -glycosidase activity is retained, but preferably from 
1 to 30, and more preferably from 1 to 18. It is preferable 
to delete, replace or add an amino acid residue or residues 
present in any of the regions of amino acid residues 78 - 86, 
154-171 and 1-423. The enzyme preferably has an optimum 
temperature of lOO'' C or higher. 

The present invention also provides a DNA which is 
capable of hybridizing to the nucleotide sequence of SEQ 
ID NO: 1 or to the complement thereof under such conditions 
that the hybridization is carried out in 6xSSC and 50% 
formamide at 42 C and the washing process is carried out 
in 6xSSC and 40% formamide at 25 , and which encodes a 
thermophilic enzyme having jS -glycosidase activity. These 
conditions are of low stringent. A moderate stringent 
conditions are such that the hybridization is carried out 
in GxSSC and 40% formamide at 42 °C and the washing process 
is carried out in IxSSC and 0% formamide at 5 5 °C . a high 
stringent conditions are such that the hybridization is 
carried out in 6xSSC and 30% formamide at 42 °C and the 
washing process is carried out in 0 . IxSSC and 0% formamide 
at 62 C . The DNA may encode a thermophilic enzyme which 
comprises the amino acid sequence of SEQ ID NO: 2 in which 
of one or a plurality of amino acid residues may be deleted. 



replaced or added and which has 0 -glycosidase activity. 

The present invention further provides a recombinant 
vector containing the DNA therein, a host cell transformed 
with the recombinant vector, and a process for producing 
the enzyme comprising culturing a host cell transformed 
with an expression vector containing a DNA encoding the 
enzyme and then collecting the enzyme from the resultant 
culture. Using this process, the mass production of the 
enzyme becomes possible. 

The present invention further provides a process for 
the hydrolysis of a /? -glycoside having a long alkyl chain 
at the reducing end, with a thermophilic enzyme having 0 
-glycosidase activity which comprises the amino acid 
sequence of SEQ ID NO: 2 in which one or a plurality of 
amino acid residues may be deleted, replaced or added. The 
long alkyl chain may be an alkyl group having carbon atoms 
of 8 or more. The hydrolysis may be carried out at a 

temperature of 85° C or higher, and preferably 100° C or 
higher . 

This specification includes part or all of the contents 
as disclosed in the specification and/or drawings of 
Japanese Patent Application No. 10-222866, which is a 
priority document of the present application and 
incorporated herein by reference in its entirety. 

The above and other objects, effects, features and 
advantages of the present invention will become more 
apparent from the following description of embodiments 
thereof taken in conjunction with the accompanying 
drawings . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the effect of the Triton X-100 
concentration on the His-BGPh Activity. The standard of 
100% was defined as the activity at 0.1% Triton X-100. 

Figure 2 shows thermostability of His-BGPh at 90°C. 
Triton X-100 at 0.1% was present in the reaction mixtures. 
The standard of 100% was defined as the activity without 
heating . 

Figure 3 shows optimum pH of the activity for His- 
BGPh. The OD405 indicates the amount of released p-Nph 
group in acetate buffer (square) and phosphate buffer 
(circle). The closed symbols correspond to the activity 
of BGPh and open symbols correspond to the activity of 
His-BGPh. For these measurements, equal amounts of BGPh 
and His-BGPh were used because the heated suspension I 
(BL21 (DE3 ) /pET- lla/BGPh or BL2 1 ( DEB ) /pET - 1 5b/BGPh) was 
estimated to contain the same amount of each induced protein 
by quantification using SDS-PAGE analysis. 

Figure 4 shows temperature dependency of BGPh. 
Optimum temperature was determined by the plots of 
enzymatic activity (OD405 nm change) against reaction 
temperature. An Arrhenius plot of the data is given in the 
inset . 

Figure 5 shows aligned amino acid sequences of five 
0 - glycosidases from hyperthermophilic archaea. The 
abbreviations of the sources of the enzymes are : BGPh, 
^ - glycosidase from P. horikoshii; BMPh, a 0 - 
mannosidase gene homolog from P. horikoshii (8, 9); BGPf, 
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/3 - glucosidase from P. furiosus (17) ; BMPf , jS - mannosidase 
from P. furiosus (17); S /3 - gly, 0- glycosidase 
f romSul f olobus solfataricus (18) . The conserved residues, 
identified automatically by the GeneWorks program, are 
shown in the open boxes. The reversed open triangles 
indicate the location of the nucleophile (E324) and the 
putative acid/base catalyst (E155 and Hill) with R75 in 
the spatial proximity of the nucleophile of BGPh. The 
arrow shows the prominent deletion of more than 30 residues 
found in BGPh. 

Figure 6 shows illustrated location of the four 
hydrophilic edges on the tetragonal structure of S0 - gly 
(30) and the four hydrophobic areas exposed by removing 
the hydrophilic loops forming the edges. (A) The 
tetragonal arrengement with the hydrophilic edges (blue) . 
(B) The tetragonal arrengement with a hydrophobic surface 
(red) created by the deletion of the hydrophilic loops, 
shielding barrel helices 3 and 4 from solvent. 

Figure 7 shows a comparison of hydropacy profiles 
between BGPh and 30- gly. The panel (A) shows the 
hydropacy profile of BGPh and panel (B) shows the hydropacy 
profile of S0 - gly. The arrows indicate the corresponding 
residue numbers. Two hydrophobic clusters are observed in 
BGPh but not in sP - gly. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention will be described specifically 
be 1 ow . 

The enzyme according to the present invention is a 



thermophilic enzyme having jS - glycosidase activity which 
comprises the amino acid sequence of SEQ ID NO: 2 in which 
one or a plurality of amino acid residues may be deleted, 
replaced or added. The enzyme comprising an amino acid 
sequence of SEQ ID NO: 2 and having -glycosidase activity 
is derived from a su 1 f u r - me t ab o 1 i z ab 1 e thermophilic 
archaeon Pyrococcus horikoshii (the accession number: JCM 
9974), One example of the processes for producing the 
enzyme is described below. 

First, cells of Pyrococcus horikoshii are cultured and 
then chromosomal DNA was prepared therefrom. The 
chromosomal DNA is digested with restriction enzyme (s) to 
give fragments, and a genomic DNA library is constructed 
using the fragments. Clones which cover the chromosome of 
Pyrococcus horikoshii are selected and aligned. The 
aligned clones are sequenced and a gene encoding a & - 
glycosidase is identified. The nucleotide sequence of the 
gene encoding 0 -glycosidase is depicted in SEQ ID NO: 1. 
The gene is amplified by the PGR method and then extracted. 
The extracted gene is inserted into an expression plasmid 
suitable for protein production (e.g., pETlla or pETlSb) . 
The resultant recombinant plasmid is introduced into cells 
of a host (e.g., Escherichia coli), from which the enzyme 
can be produced. The produced enzyme is isolated and 
purified by heating and then subjecting to column 
chromatography . 

As a result, it is revealed that the purified enzyme 
is a protein having a molecular weight of about 45,000 Da 
and capable of hydrolyzing $ -glycosides. When the enzyme 
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is treated in 50 mM phosphate buffer (pH 6.0) containing 
250 mM NaCl at 95 C for 1 hour, its activity is retained 
at the level of 80% based on the initial level. The enzyme 
has an optimum pH of pH 6.0 and an optimum temperature of 
lOO'' C or higher in terms of the enzymatic activity. 

Variants of the enzyme, that is, thermophilic enzymes 
comprising deletion, replacement or addition of one or a 
plurality of amino acid residues in the amino acid sequence 
of SEQ ID NO: 2 and having /3 -glycosidase activity, may be 
prepared by any known techniques, such as s i te - spec i f i c 
mutagenesis and the PGR method. 

The enzymes of the present invention can be used for 
hydrolysis of saccharides, DNA sequencing, conformational 
analysis of glycoproteins and glycolipids, synthesis of 
or igosacchar ides and he terosaccharides with high optical 
purities, and the like. 

DEPOSIT OF MICROORGANISM 

A transformant designated "E, coli BL21 (DE3) 
pET15b/Gly2M" which is E. coli BL21 (DE3) transformed with 
an expression vector containing a 0 -glycosidase gene 
{pET15b/Gly2M) was deposited under the terms of the 
Budapest Treaty on July 27, 1999 at the National Institute 
of Bioscience and Human - t e chno 1 ogy , Agency of Industrial 
Science and Technology, Japan (1-3, Higashi 1-chome, 
Tsukuba-shi, Ibaragi-ken, Japan) under Accession No, PERM 
BP - 6 8 0 0 . 

The following examples are given as more specific 
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illustration of the invention. It should be understood, 
however, that the invention is not limited to the specific 
details set forth in the examples. 

EXAMPLES 

Abbr ev i t i on s : BGPh, -glycosidase from P. horikoshii; 
BMPh, a 0 -mannosidase gene homolog from P. horikoshii; 
BGPf , 0 -glucosidase from P. furiosus; BMPf , j3 -mannosidase 
from P. furiosus; S jS -gly, 0 -glycosidase f romSul f olobus 
solf atar icus ; Amp, ampicillin; IPTG, isopropyl-/3 -D- 
thiogalactopyranoside ; His-BGPh, BGPh with His-tag; 
SDS-PAGE, sodium dodecyl s u 1 f a t e - po 1 y ac ry 1 ami d e gel 
electrophoresis; CBBR, Coomassie Brilliant Blue R; X-Glu, 
5 - bromo - 4- chloro-3-indolyl-iS - glucopyranoside; p-Nph-/3 
-D-Glcp, p - ni trophenyl /3 - D - glucopyranos ide ; LA- jS -D-Glcp, 
$ - D - glucopyranos i de s with long alkyl chains. 

MATKRTAT.fi AND MF.TKOnc; 

Chemicals - The pET-lla vector and ul tr acompe ten t E. 
coli XL2-Blue MRF ' cell were purchased from Stratagene. 
The pET-15b vector and E. coli strain BL21 (DE3) were 
obtained from Novagen. Vent DNA polymerase was purchased 
from New England Biolabs. Restriction enzymes were 
purchased from Promega and Toyobo (Osaka, Japan) , and were 
used according to the manufacturers' recommendations. 
Ultrapure deoxynucl eot ide solution (dNTPs) was obtained 
from Pharmacia Biotech, Isopropyl-;3 - D- 

thiogalac topyranoside (IPTG) was from Takara Shuzo (Otsu, 
Shiga , Japan ) . 
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cloning of Genes and Construction of Expression Vector 
The genome of P. horikoshii was sequenced using the 
method of Kaneko et al . (10) . Standard cloning techniques 
were used throughout. The expression vectors pET-lla and 
pET-15b were doubl e - diges ted by the restriction enzymes 
Nde I and BamH I and the resulting 5.7Kbp fragment was 
purified with a QIAquick Gel Extraction Kit (QIAGEN) . The 

gene coding 0 - glycosidase (BGPh) was amplified by the PGR 
method using the following two primers: upper primer, 
TAAGAAGGAGATATACATATGCCGCTGAAATTCCCGGAAATGTTTCTCTTTGGT 
ACC (SEQ ID NO: 3); lower primer, 

TTTACTGCAGAGAGGATCCCTAATCCTAAAGTTGAAGTTCTGGTAG (SEQ ID 
NO: 4) . The PGR product was cloned into expression vectors 
pET-lla and pET-15b using Ndel and BamHI sites. 

The digested 1.3 Kbp fragment coding BGPh was purified 
and ligated to the insertion sites of the pET-lla and 
pET-15b vectors. Ul tracompe ten t E. coli XL2-Blue MRF ' 
cells were transformed with the recombinant molecule, 
Transf ormants were screened on 2 xYT plates containing 50 
mg/ml of ampicillin (Amp) incubated at Sl^'C overnight. 
The transformant colonies were propagated in 5 ml 2 x YT 
+ Amp medium at 37°C overnight and the vectors pET-lla/BGPh 
and pET-15b/BGPh were purified after cen tr i f uga t ion using 
a Mini Plasmid Kit (QIAGEN). The pET-lla/BGPh and pET- 
15b/BGPh were double - diges ted with Ndel and BamHI and the 
insert length was checked using agarose gel 
electrophoresis. The absence of additional mutations 
within the coding region of BGPh was verified by sequencing 
on an Applied Biosystems 373A DNA sequencer (Taq DyeDeoxy 
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Terminator Cycle Sequencing Kit, PerkinElmer) . 

Overexpre s s ion and Purification of Recombinant Protein 
The E. coli strain BL21 (Dfi3) was transformed with the 
pET-lla/BGPh plasmid to express mature BGPh and pET- 
15b/BGPh plasmid to express His-tagged BGPh. The 
transformant colony was propagated as seed culture in 200 
ml 2 X YT -t-Amp medium at 3 7 °C overnight. An inoculate of 
40 ml seed culture was inoculated to 2 1 of 2 x YT +Amp 
medium. The transformant was induced at OD600 = 1 with 1 mM 
IPTG for 4 h. The induced cells were collected by 
centrif ugation and stored at -20''C . 

The frozen cells (7 g) were thawed and mixed with 10 
ml of 50 mM Tris-HCl buffer (pH 7.5) and 5 . 6 ml of 10% Triton 
X-100/ resulting in a final concentration of 2.5%, The cell 
suspension was heated at S5°C for 10 min, then centrifuged 
at 5000 X g for 20 min . The supernatant was collected and 
stored at 4''C , The cell pellet was mixed with the same 
volume of the buffer and Triton X-100 and heated again. 
The heated sample was centrifuged at 25000 x g for 20 min. 
The combined supernatant was mixed with 1 mg of bovine DNase 
1 (Sigma) and incubated at 37°C for 30 min. The supernatant 
was heated at 85°C for 10 min, then centrifuged at 25000 
X g for 20 min to remove the inactivated DNase. 

The solubilized recombinant BGPh with His-tag 
(Hi s - BGPh) was subjected to affinity chromatography with 
Ni - con j uga ted Sepharose, using a stepwise elution from 5 
mM to 1 M imidazole in 20 mM Tris-HCl (pH 8.0) with 0,5 
M NaCl solution (His-bind Buffer Kit, Novagen) containing 
0.1% Triton X-100. BGPh was eluted with 100 mM imidazole 
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with 0.1% Triton X-100. The enzyme samples were analyzed 

by sodium dodecyl sulfate -polyacrylamide gel 
electrophoresis (SDS-PAGE) (11); a low molecular weight 
electrophoresis calibration kit, purchased from Pharmacia 
Biotech, was also run. For SDS-PAGE (PhastGel, 10-15%), 
the enzyme sample (5 ml) was mixed with SDS sample buffer 
(5 ml) , boiled for 5 min, mixed with marker dye (1 ml) and 
applied to the gel in 1 or 4 ml aliquots. Following 
electrophoresis, protein was detected by Coomassie 
Brilliant Blue R (CBBR) staining according to the 
manufacturer's recommendation. The His-tagged protein 
was detected with QlAexpress Detection System (QIAGEN) 
after blotting onto a nitrocellulose membrane (Pharmacia 
Biotech) . 

Cellular Localization of the Activity - Localization 
of the BGPh activity in E. coli transformant cells 

(BL21 (DE3) /pET-lla/BGPh or BL2 1 (DE3 ) /pET - 15b/BGPh) was 
examined by fractionation of the cell components. The cell 
membrane was isolated as follows: 7 g of the induced cells, 
which were frozen at -20 ''C , were thawed and mixed with 10 
ml of 50 mM Tris-HCl buffer (pH 7.5) . The cell suspension 

(suspension I) was sonicated with a Sonifier 250 (Branson) 
for 4 min at an output control level of 4 and at 30% duty 
cycle. The sonicated sample was centrifuged at 9,000 x g 
for 10 min to remove cell debris, then the supernatant (12 
ml) was ul t racen tr i f uged at 100 , 000 x g for 1 h to separate 
the membrane fraction (1 ml) from the supernatant. The 
enzyme reactions were carried out at 90°C for 15 min in 
a solution (200 ml) containing 1.2 mM 5-bromo-4- 
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chloro - 3 - indolyl - iS - g 1 u c opy r ano s i de (X-Glu) and 5 ml of 
each fraction, as the enzyme source, in 50 mM phosphate 
buffer (pH 6) with 0.3 M NaCl . After the reaction, the 
solution was cooled in ice and diluted with 1 ml of water; 
the absorbance at 62 0 nm was immediately measured. As a 
control, the assay reactions were performed under the same 
conditions but without X-Glu to subtract the turbidity 
derived from each fractionated sample. 

To analyze the solubilizing effect of Triton X-100, 
suspension I was also heated with and without 2.5% Triton 
X-100 at 85°C for 10 min and the supernatant was obtained 
by centr if ugation at 15,000 x g for 10 min. The activity 
of the supernatants was measured using X-Glu as shown above. 

Dependence of the BGPh Activity on Triton X-lOO - The 
enzyme reactions were carried out at 9 B°C for 20 min in 
a solution ( 200 ml) containing 3 mM p-Nph-/8 - D-Glcp (a 
p - ni tr opheny 1 saccharide) and 57.5 pM of the purified 
His-BGPh in 5 0 mM phosphate buf fer (pH6) wi th Tr i t on X - 1 0 0 
and 0.1 M NaCl . The concentration of Triton X-100 in the 
reaction solution was varied from 0.1% to 0.00002%. The 
reaction was terminated by the addition of 1 M Na2C03 (1 
ml), then centrifuged at 15,000 x g for 10 min. The 
concentration of the p-Nph group in the supernatant was 
quantified by measuring the absorbance at 400 nm . 

Measureinent of the Kinetic Parameters - The enzyme 

reactions were carried out at 90 °C in a solution (200 ml) 
containing the substrate and the purified His-BGPh in 50 
mM phosphate buffer (pH 6) with 0,1% Triton X-100 and 0.3 
M NaCl. For the hydrolysis of p - n i t r opheny 1 (p-Nph) 0 
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- D - saccharides , the reaction was terminated by the 
addition of 1 M Na2C03 (1 ml), then centrifuged at 15,000 
X g for 10 min. The concentration of the p-Nph group in 
the supernatant was quantified by measuring the absorbance 
at 400 nm , For the hydrolysis of 0 - D-glucoside, the 
released glucose was analyzed with a Glucose C-II Test kit 
{Wako Pure Chemicals, Japan) . Initial velocities were 
obtained directly from the initial slopes of the time course 
plots. The Km and kcat values were calculated using the 
Mi chae 1 i s - Men t en equation and the least squares method 
(12) , The subsite affinity for a long alkyl chain was 
determined using the method reported previously (13-15) 
on the basis of the subsite theory (16) . 

Optimum Temperature and Optimum pH - The optimum 
temperature was measured as follows: the assay mixture (200 
ml) , which contained 3 mM p - n i t r opheny 1 /3 - D - 
glucopyranos ide (p-Nph-jS - D-Glcp) i n 1 5 0 mM c i t r a t e buf f er 
(pH 5.0) and 1 ml of suspension I (BL21(DE3)/pET-lla/BGPh) , 
was heated at 85C for 10 min. The enzyme reactions were 
carried out in duplicate at temperatures ranging from 50 
to 100 C for 30 min. Optical density measurements at 
A405 were performed as described for the enzyme assays. 

To determine the optimum pH, the assay mixture (200 
ml) , which contained 1 ml of heated suspension I 
(BL21(DE3)/pET-lla/BGPh or BL21(DE3)/pET-15b/BGPh) and 
p-Nph- 0 - D-Glcp (3 mM) in 139 mM buffer systems, was heated 
at 90 °C for 30 min. The pH of the reaction mixtures ranged 
from 3.9 to 5.5 in sodium acetate buffer and from 5.5 to 
7.99 in phosphate buffer. Optical density measurements 
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at A405 were performed as described for the enzyme assays. 

Thermos tabil ity - The His-BGPh solutions (29 nM) in 50 
mM phosphate buffer (pH 6,0) containing 100 mM NaCl and 
0.1% Triton X-100 were heated in sealed Eppendorf tubes 
at 90^C in various increments up to 24 h. The heated 
enzymes were assayed in duplicate in phosphate buffer (pH 
6.0) at 90C for 20 min as described for the determination 
of optimum temperature. 

Sequence Alignment , Phylogenetic Tree, and Hydropacy 
Profile - Sequence alignment of jS - glycosidases was 
performed using the GeneWorks program ( I n t e 1 1 i Gene ti c s , 
Inc.) based on a PAM-250 scoring matrix. The enzymes of 
interest were: 0 - glycosidase (BGPh) studied in this paper 
and iS - mannosidase (BMPh) from P. horikoshii (8, 9) , iS 
- glucosidase (BGPf) and - mannosidase (BMPf) from P. 
furiosus (17) , and /3 - glycosidase {30 - gly) 
f romSul f olobus solfataricus (18) . Phylogenetic trees 
for the same sequences were constructed using the GeneWorks 
program based on the unweighted pair group method with an 
arithmetic mean (19) . Each hydropacy profile was analyzed 
with DNASIS-Mac v2 . 0 software based on the Kyte and 
Doolittle method (20). 

RRflTTT.T.q AND D T H T7 5? T ON 

Localization of the Activity in E. coli Membrane - The 
intracellular localization of His-BGPhwas examined (Table 
I) . 
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Table [. Cellular localization of the activity. The U-ansformaiit ZT. coli BL21 (DE3)/pET15b/BGPh 
cells were used for this experiment. The enzyme reaction were performed at 90^C and pH 6 for 15 
min using X-Glu as substrate, and then A.^n was measured as shown in "MATERIALS AND 
METHODS". 



Activity after each treatment (A^^y) 



Cell fractions Sonication Non-heated Heated Non-healed with Heated with 

2.5% Triton X-IOO 2.5% Triton X-100 



Suspension I 0.585 0.585 0.567 0.485 0.428 

Supernatant at 9,000 xg 0.112 ND ND ND ND 

Supernatant at 15,000 xg ND 0.008 0.005 0.107 0.255 

Supernatant at 100,000 xg 0,010 ND ND ND ND 

Fraction precipitated at 0.478 ND ND ND ND 
100,000 xg 



ND; not determined. 

The induced cells were disrupted by sonication and 
centrifuged to separate the cell components. The membrane 
fraction was precipitated by ul tracentri f ugation at 
100,000 X g from the supernatant recovered by 
centr i f ugation at 9., 0 0 0 x g. The activity was present in 
the membrane fraction whereas no activity was detected in 
the soluble fraction after the ul tracentri f ugation . 
His-BGPhwas solubilized from the cell suspension 
(suspension I) by heating with a detergent, Triton X-100; 
the enzyme was not solubilized by heating without Triton 
X-100. The solubilizing efficiency with Triton X-100 was 
elevated by heating up to 85'C , whereas only 22% of the 
activity was extracted at room temperature. The best 
condition for the solubilization was 2,5% Triton X-100 at 
85'C for 15 min. The native-type BGPhwas also solubilized 
under the same condition as His-BGPh (data not shown); 
however, the denaturation with 8 M urea and the renaturation 
by direct dilution with buffer had no effect on the 
solubilization of the activity (data not shown) . These 
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facts strongly indicate that BGPh is a thermostable 
membrane protein solubilized by Triton X-100. 

His-BGPh was purified by one-step affinity 
chromatography using Ni - con j uga t ed Sepharose. Since the 
recovery of the active enzyme was decreased to a few percent 
by the elimination of Triton X-100 from the chromatographic 
washing and elution buffers, the presence of Triton X- 
100 in the buffer system was essential for the stabilization 
of BGPh. 

As shown in Figure 1, the activity of BGPh was dependent 
on the concentration of Triton X-100. At 0.00002% Triton 
X-100, the activity decreased to 10% of that with 0.1% 
Triton X-100. Furthermore, BGPh was stabilized in the 
presence of 0.1% Triton X-100: the half-life of the activity 
was 15 h at 90^C and pH 6.0 (Fig. 2) . These facts also 
suggest that BGPh is the membrane protein. 

The Substrate Specificity of BGPh - For BGPh both with 
or without His-tag, the optimum pH was 6.0 (Fig. 3) and 
the optimum temperature was over 100 C (Fig. 4) . The 
substrate specificity of His-BGPh was examined using 
p-Nph - j8 - D-saccharides and j3 - D-glucosides as substrates. 
The specificity is summarized in Table II in comparison 
with that of Si3-gly (7, 21). 
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Table 11. Comparison of the kinetic parameters between his-tagged BGPh from P. horikoshii and 
S|3-gly from S. solfataricus strain MT-4 against p-Nph-P-D-saccharides and [3-D-glucosides. 



His-BGP/z (90°C and pH 6.0) Sp-giy" (75'C and pH 6.5) 

Substrates (sec"') (mM) (nxM-'sec'') (sec"') (miVI) (mM-W'') 



Laminanbiose 




1 Q/t 
i 04 


i JO.ZJ 


Ceilobiose 




194 


1698.18 


Ceilotriose 






ND 


Ceilotetraose 




^fD 


ND 


j3-Gentiobiose 




iND 


ND 


p-iN pn- p-D -uicp 




79 




/7-Npa-p-D-Gaif 




12j 


loO 


/7-Npn-p-D-XyI/7 




3 


0.10 


p-Nph- (3-D -Manp 




2 


0.14 


Salicin 




44 • 


i.yo 


Methyl-^-D-Glcp (AlkyI : 




35 


40.74 


n-Amyl-jS-D-Glcp (Alkyl : 


:Q) 


31 


2.02 


n-Hexyl-j3-D-Glc/7 (Alkyl 


:Q) 




0.54 


n-0ctyl-^D-Glcj3 (Alkyl : 


Cs) 


34 


0.20 


n-Nonyl-p-D-Glcp (Alkyl 


:C,) 


39 


0.08 


n-Decyi-p-D-Glcp (Alkyl 


:C^o) 37 


0.08 


n-Undecyl-P-D-Glcp (Alkyl:C„) 43 


0.05 


n-Dodecyl-^D-Glcp (Alkyl:C,2) 36 


0.03 



1 '3 '5 

i.ji 


908 


1 A 
l.U 


yuo.u 


0.11 


1333 


30.0 


44.4 


ND 


197 


3.0 


66 


ND 


584 


1.7 


343 


ND 


1360 


100 


14 


225.67 


542 


0.5 


1084.0 


94.34 


1020 


4.7 


217.0 


31.83 


284 


4.0 


71.0 


14.60 


NH= 


NH 


NH 


22.20 


880 


5.0 


175.9 


0.85 








15.11 


256 


1.1 


232 


60.28 


263 


1.0 


263 


170.70 


313 


0.7 


434 



471.57 
469.62 
944.37 
1152.90 



* Citated from references (7, 21). 

" ND; The parameters were not determined because of too high values. 
" NH; The substrate was not hydrolyzed by S|3-gly. 
-; The parameters were not reported in the references. 
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His-BGPh hydrolyzed aryl glycosides efficiently, 
showing kcat/Km values decreasing in the order p-Nph-^ - 
D-Glcp > p-Nph-iS- D-Galp > p-Nph-/3- D-Xylp > p-Nph-^- 
D-Manp. Beta-linked glucose dimers tested were poorly 
hydrolyzed; the order of preference was /3 1-3 > 0 1-4 > 0 
1-6. The kcat values of BGPh without His-tag for these /3 
- linked glucose dimers approached 400 sec*^, which is 
comparable with those of S /3 - gly (Table II). His-BGPh 
probably had approximately 50% of the activity of BGPh due 
to interference by the His-tag located at the N-terminus, 
(Fig. 3) . Surprisingly, the best substrates for His-BGPh 
were 0 - D - g 1 uco s i de s with long alkyl chains (LA- iS - D-Glcp) . 
The Km values decreased according to the elongation of the 
alkyl chain from to 0^2^ although the kcat value was 
constant (approximately 35 sec'M for each alkyl- /3 - D-Glcp, 
The kcat values of native type BGPh for LA-i3 - D-Glcp 
approached 70 sec"\ calculated on the basis of the value 
of His-BGPh, estimating a 50% decrease in the activity from 
the inhibitory effect of the His-tag. The value was also 
appreciable, around 30% of that of - gly (Table II) , The 
Km value of His-BGPh for the hydrolysis of n-Dodecyl-^ - 
D-Glcp (alkyl chain : C^s) was extremely low, 30 mM at 90C 
and pH 6.0. Of the substrates examined thus far, the best 
substrate was n-Dodecyl-jS - D-Glcp as shown in Table II. 
The kcat/Km value of His-BGPh against n-Dodecyl- jS - D-Glcp 
was 5 times higher than that of p-Nph-^ - D-Glcp and 870 
times higher than that of laminaribiose . Even the value 
for n-0ctyl-j3 - D-Glcp was 0.76 times higher than that of 
p-Nph-jS - D-Glcp and 128 times higher than that of 
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laminar ibiose . The kcat/Km value of 30 - gly against 
n-0ctyl-i3 - D-Glcp, with the longest alkyl chain so far 
examined (21), was 0.4-fold higher than that for p-Nph- 
/3 - D-Glcp and 0.48-foldhigher than that for laminaribiose. 
Laminar ibiose and cellobiose were not good substrates for 
the hydrolysis of His-BGPh because of their Km values higher 
than 100 mM . His-BGPh also hydrolyzed cellotriose and 
cellotetraose with low efficiency: the kinetic parameters 
were not determined because of the extremely high Km value, 
whereas S jS - gly was able to hydrolyze these 
oligosaccharides with high efficiency: the kcat/Km values 
descended in the order; cellotetraose > cellotriose > 
cellobiose. Thus, the substrate specificity of His-BGPh 
is different from those of the other jS - glycos idases , 
including S jS - gly (7, 17, 21-23) . BGPh has a novel 
substrate specificity with high efficiency to hydrolyze 
LA- /3 - D-Glcp and low efficiency to hydrolyze any j3 - linked 
glucose dimer which is more hydrophilic than aryl- or alkyl- 
iS - D-Glcp. The subsite affinity (A{cii}) to bind a long 
alkyl chain ^^id was calculated according to the following 
equation; A (cm ( { kca t / Km) f n-Dodecyi - - d-gicp / (kcat/Km) 
Methyi-i3- D-Glcp) • The affinity was determined to be 4.26 
kcal/mol. The value was reasonable when compared with the 
highest affinity (4,23 kcal/mol) known, that of the 
recognition of one glucose unit in the subsite structure 
of Saccharomycops i s amylase (13, 14) . These facts 
indicate that the hy dr ophob i c i ty of the aglycon part of 
the substrates is strongly recognized by the BGPh molecule 
and the hydrophobic substrates, including aryl- and LA- 
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/3 - D-Glcp, are hydrolyzed effectively with low Km values 
due to hydrophobic interaction between the aglycon moiety 
and the BGPh molecule. Thus, BGPh might be useful to 
synthesize novel /3 - glycosides, including new 
biosurf ac tants , using its transglycosylation activity 
because of its stability in organic solvents (data not 
shown) . 

Henrissat proposed an alternate and complementary 
classification scheme for glycosyl hydrolases based on 
amino acid sequence similarities (24 - 26) . For example, 
glycosyl hydrolase family 1 is composed of exo-acting, 0 
' specific enzymes with similar amino acid sequences. The 
five jS - glycosidases , including BGPh from the archaea 
domain (as shown in Fig. 5), belong to family 1. Some 
family 1 glycosyl hydrolases also have glycosyl 
transferase activities. The S. solfataricus ^- 
glucosidase has been implicated in the glycosy la t ion of 
membrane lipid components (27) . Similarly, the enzymatic 
analysis of BMPf predicted its possible role in the 
synthesize of intracellular components including protein, 
membrane components, or other compounds (17) . Since the 
localization of BGPh on E. coli membrane strongly indicates 
the intimate interaction of the enzyme and lipid components, 
the detection of BGPh on the Pyrococcus cell surface using 
antibody against the enzyme must be done to clarify its 
true function in the Pyrococcus cell. 

The Structural Elements Responsible for Membrane 
Local! zat ion and the Conservation of Res idues Forming the 
Active Site - The sequence alignment among BGPh and four 
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different /3 - glycosidas es , whose biochemical 
characteristics have been reported (7, 17, 21-23) , is shown 
in Figure 5. According to the phylogenetic analysis based 
on the alignment, the tree has three branches : one 
corresponding to a j8 - glycosidase group that includes BGPf 
and S j8 - gly; another containing BMPh and BMPf , which were 
close to jS - mannosidase. BGPh belongs to the third branch, 
located some distance from the first two branches. The 
polypeptide length of BGPh is also approximately 13% 
shorter than those of the other four 0 - glycosidases and 
might be one of the shortest sequences so far reported (8, 
17, 18, 28) , As shown in Figure 5, the residues E155 and 
Hill of BGPh correspond to E206 and H150 as the putative 
acid/base catalyst in the 30 - gly molecule (28, 29) , whose 
steric structure has been reported (30). The residues 
E324 and R75 of BGPh correspond to E387, the nucleophile, 
and R79 in the spatial proximity of the nucleophile (28, 
29) , The complex structure of Bacillus polymixa 0 • 
glycosidase with the inhibitor gluconate has been reported 
(31), The BGPh residues, Q19, Hill, N154, E155, Y267, 
E324, W362, E369, and W370 are completely conserved (Fig. 
5) and correspond to the B. polymixa jS - glycosidase 
residues, Q20, H121, N165, E166, Y296, E352, W398, E405, 
and W406, which form the intimate interaction with the 
inhibi tor (31). 

To understand the localization mechanism of BGPh to 
the membrane, a major structural difference between BGPh 
and the other soluble 0 - glycosidases was analyzed using 
the sequence alignment and the steric structure of S0 - 
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gly (30) . The SjS - glymolecule has the classic ( 0 oc) q barrel 
fold first seen in the structure of triose phosphate 
isomerase (32) , For BGPh, the prominent deletion of more 
than 30 residues was found after the 78th residue, as 
indicated in Figure 5. The deletion region of BGPh 
corresponds to loops from the 89th to 125th residues of 
S j8 - gly, mainly shielding the helices 3 and 4 from solvent. 
The hydrophilic loops, which pack against the outer face 
of the barrel helices 3 and 4, were not present in the BGPh 
molecule, A tetrameric 80- gly structure has been 
reported, in which these loop regions were located at the 
four edges of regular tetragonal molecular arrangement 
(30). Figure 6 illustrates the location of the four 
hydrophilic edges and four hydrophobic areas which appear 
following the removal of the hydrophilic loops. Since BGPh 
as well as 30 - gly was proved to be tetramer by gel 
filtration using buffer containing 0.01% Triton X-100 
(data not shown) , the deletion of these hydrophilic loops 
probably results in the exposure of helices 3 and 4 to the 
solvent at the four edges of the tetrameric structure. The 
exposed hydrophobic areas might interact with lipid 
components to embed the molecule in the membrane. 

The increased hy dr ophob i c i ty at barrel helices 3 and 
4 is also indicated by the comparison of the hydropacy plots 
of BGPh and 30 - gly, as shown in Figure 7. Two major 
hydrophobic clusters were observed in the region of BGPh 
between residues 79 and 210 corresponding to the region 
of 30' gly between residues 90 and 265. These residues 
form tertiary structures from the end of 0 - sheet 2 to the 
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beginning of ^- sheet 5 of the ( 0 a) ^ barrel fold (30). 
The first cluster was located between residues 79 to 114, 
forming a helix with a loop shortened by the deletion, 
a-helix 2, and jS - sheet 3. The second cluster was present 
between residues 131 and 210, corresponding the barrel 
fold between a-helices 3 and 4 exposed to solvent, A 
hydrophilic module that might be important for enzyme 
orientation on the membrane was found between residues 114 
to 131, corresponding to the hydrophilic helices at the 
molecule surface located between the /3 - strand and the 
a-helix in the third repeat of the barrel fold. The two 
hydrophobic clusters, but not the hydrophilic module, were 
lacking in the corresponding region of S0 - gly (18) . 

A mechanism for the localization of BGPh is proposed 
here based on the possible hydrophobic interaction between 
the membrane and the exposed hydrophobic helices 3 and 4 
at the four edges of the tetrameric structure exposed by 
the deletion of the hydrophilic loops. Furthermore, the 
mechanism is well supported by the hydropacy profile of 
BGPh, in which the hydrophobic cluster is formed by the 
barrel fold between a-helices 3 and 4. The exposed 
hydrophobic areas may lead the hydrophobic substrates to 
the active site and bind them there. However, further 
studies using the crys tal lographi c analysis are needed for 
a more definitive description of the detailed mechanism 
for recognition of the hydrophobic aglycon part, including 
a long alkyl - chain . 

As described above, the present invention provides a 
novel i3 - glycosidase . The i3 -glycosidase is stable under 
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extreme conditions. Therefore, the P -glycosidase can be 
used to develop he terosaccharides with high optical 
pur i ties. 
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cited herein are incorporated herein by reference in their 
en t i ty . 

The invention has been described in detail with 
reference to various embodiments, and it will now be 
apparent from the foregoing to those skilled in the art 
that changes and modifications may be made without 
departing from the invention in its broader aspects, and 
it is the invention, therefore, in the appended claims to 
cover all such changes and modifications as fall within 
the true spirit of the invention. 

The following is information on sequences described 
herein : 

SEQUENCE LISTING 

<110> Director - General of Agency of Industrial Science and Technology 

<120> Heat-resistant enzyme having & - glycosidase activity 

<130> PH-679US 

<150> JP 10 - 222866 
<141> 1998-08-06 

< 1 6 0 > 4 

<170> PatentIn Ver . 2.0 

< 2 1 0 > 1 
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< 2 1 1 > 

< 2 1 2 > 

< 2 1 3 > 



126 9 
DNA 

Pyrococcus horikoshii 



<220 > 

< 2 2 1 > CDS 

<222 > ( 1 ) . - (12 69) 



< 4 0 0 > 1 

atg ccg ctg aaa ttc ccg gaa atg ttt etc ttt ggt acc gca aca tea 48 

Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

15 10 15 

tec cat cag ata gag gga aat aat aga tgg aat gat tgg tgg tac tat 96 

Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

20 25 30 

gag cag att gga aag etc cec tac aga tct ggt aag get tgc aat cac 144 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

3 5 4 0 4 5 

tgg gaa ctt tac agg gat gat att cag eta atg acc age ttg ggc tat 192 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

50 55 60 

aat get tat agg ttc tec ata gag tgg age agg eta ttc eca gag gaa 240 

Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

aat aaa ttt aat gaa gat get ttc atg aaa tac egg gag att ata gac 288 
Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

85 9 0 95 

ttg tta ttg acg aga ggt ata act cec ctg gtg acc eta cac cac ttt 336 
Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 
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100 105 110 

act age cct etc tgg ttc atg aag aaa ggt ggc ttc ctt agg gag gag 384 
Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

aac eta aaa cat tgg gaa aag tac ata gaa aag gtt get gag ctt tta 432 
Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 

130 135 140 

gaa aaa gtt aaa eta gta get acc ttc aat gag ccg atg gta tac gta 480 
Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 
145 150 155 160 

atg atg gga tat eta acg get tat tgg ece cca ttc att agg agt cca 528 
Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 

ttt aag gee ttt aag gta get gea aac ctg ctt aaa get cac gca att 576 
Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

180 185 190 

gcc tat gaa ctt ctt cat ggg aaa ttc aaa gtt gga ate gta aag aat 624 
Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 200 205 

att cec ata ata etc cca gcg agt gac aag gag agg gat aga aaa gcc 672 
lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

get gag aaa get gat aat tta ttt aac tgg cac ttt ttg gat gcg ata 720 
Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 
225 230 235 240 

tgg agt ggg aaa tac aga ggg gta ttt aaa aca tat agg att ece caa 768 
Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 

245 250 255 

agt gac gca gat ttc att ggg gtt aac tat tac acg gcc age gaa gta 816 
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Ser Asp Ala Asp Phe lie Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 265 270 

agg cat act tgg aat cct tta aaa ttc ttc ttt gag gtg aaa tta gcg 864 
Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

275 280 285 

gat att age gag agg aag act caa atg gga tgg age gtt tat cca aaa 912 
Asp lie Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

gga ata tac atg gcc ctt aaa aaa get tec agg tat gga agg cct ctt 960 
Gly lie Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 
305 310 315 320 

tat att acg gaa aac gga ata gcg acg ctt gat gat gaa tgg aga gtg 1008 
Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

325 330 335 

gaa ttc ata att caa cac etc caa tac gtt cat aag get ate gaa gac 1056 
Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala lie Glu Asp 

340 345 350 

ggc ctg gat gta aga ggt tac ttc tat tgg tea ttt atg gat aac tac 1104 
Gly Leu Asp Val a rg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 360 365 

gag tgg aaa gag ggg ttt ggg eet aga ttt ggc eta gtg gaa gtt gat 1152 
Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

370 375 380 

tat caa acc ttc gag aga agg ccc agg aag agt get tac gta tac gga 1200 
Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

gaa att gca aga agt aag gaa ata aag gat gag eta tta aag aga tat 1248 
Glu lie Ala Arg Ser Lys Glu lie Lys Asp Glu Leu Leu Lys Arg Tyr 
405 410 415 
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ggc eta cca gaa ctt caa ctt 1269 
Gly Leu Pro Glu Leu Gin Leu 
420 



<210 > 2 

<211> 423 

< 2 1 2 > PRT 

<213> Pyrococcus horikoshii 



< 4 0 0 > 2 

Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

15 10 15 

Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

20 .25 3 0 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

35 40 45 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

50 55 60 

Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

85 9 0 9 5 

Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 

100 105 110 

Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 
130 135 140 
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Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

145 150 155 160 

Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 

Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

18 0 185 190 

Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 200 205 

lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 
225 230 235 240 

Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 

245 250 255 

Ser Asp Ala Asp Phe lie Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 265 270 

Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

275 280 285 

Asp lie Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

Gly lie Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 
305 310 3 15 3 20 

Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

325 330 335 

Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

340 345 350 

Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 360 365 

Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 
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370 375 380 

Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

Glu lie Ala Arg Ser Lys Glu lie Lys Asp Glu Leu Leu Lys Arg Tyr 

405 410 415 

Gly Leu Pro Glu Leu Gin Leu 
42 0 



<210 > 3 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<22 0 > 

<223> Description of Artificial Sequence:An upper primer designed to 
create 

the Nde I site. 
<400> 3 

taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 57 



< 2 1 0 > 4 
<211> 46 
<212> DNA 

<213> Artificial Sequence 

< 2 2 0 > 
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<223> Description of Artificial Sequence:A lower primer designed to 
create the BamHI site. 

< 4 0 0 > 4 

tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 
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WHAT IS CLAIMED IS: 

1/ A thermophilic enzyme having 0 -glycosidase activity 
ywhich comprises the amino acid sequence of SEQ ID NO: 2 
in which one or a plurality of amino acid residues may be 
deleted, replaced or added. 

2. The enzyme of claim 1, having an optimum temperature 
of 100^ C or higher. 

A DNA which is capable of hybridizing to the nucleotide 
sequence of SEQ ID NO: 1 or to the complement thereof under 
such conditions that the hybridization is carried out in 
6xSSC and 50% formamide at 42 ''C and the washing process 
is carried out in 6xSSC and 40% formamide at 25 , and which 
encodes a thermophilic enzyme having 0 -glycosidase 
ac tivi ty . 

4. The DNA of claim 3, which encodes the enzyme of claim 
1 . 

5. A recombinant vector containing the DNA of claim 3 
therein . 

6. A host cell transformed with the recombinant vector 
of claim 5 . 

7. A process for producing the enzyme of claim 1, 
comprising culturing a host cell transformed with an 
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expression vector containing a DNA encoding the enzyme and 
then collecting the enzyme from the resultant culture. 



a long alkyl chain at the reducing end, with a thermophilic 
enzyme having jS -glycosidase activity which comprises the 
amino acid sequence of SEQ ID NO: 2 in which one or a 
plurality of amino acid residues may be deleted, replaced 
or added . 

9. The process of claim 8, wherein the long alkyl chain 
is an alkyl group having carbon atoms of 8 or more. 

10. The process of claim 8, wherein the hydrolysis is 

o 

carried out at a temperature of 85 C or higher. 

11. The process of claim 8, wherein the hydrolysis is 
carried out at a temperature of lOO"* C or higher. 




for the hydrolysis 



of a jS -glycoside having 
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ABSTRUCT OF THE DISCLOSURE 

The invention relates to a thermophilic enzyme having 
/3 -glycosidase activity which comprises the amino acid 
sequence of SEQ ID NO: 2 in which one or a plurality of 
amino acid residues may be deleted, replaced or added. 
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SEQUENCE LISTING 



<110> Director - General of Agency of Industrial Science and Technology 

<120> Heat-resistant enzyme having $ - glycosidase activity 

<130> PH-679US 

<150> JP 10-222866 
<141> 1998-08-06 

< 160 > 4 

<170> PatentIn Ver. 2.0 

< 2 10> 1 
<211> 1269 

< 2 1 2 > DNA 

<213> Pyrococcus horikoshii 

< 220> 

< 22 1 > CDS 

< 222 > (1) . , ( 1269) 

<400 > 1 

atg ccg ctg aaa ttc ccg gaa 

Met Pro Leu Lys Phe Pro Glu 

1 5 

tec cat cag ata gag gga aat 



atg ttt etc ttt ggt acc gca aca tea 48 

Met Phe Leu Phe Gly Thr Ala Thr Ser 

10 15 

aat aga tgg aat gat tgg tgg tac tat 96 
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Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

2 0 2 5 3 0 

gag cag att gga aag etc ccc tac aga tct ggt aag get tgc aat cac 144 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

3 5 4 0 4 5 

tgg gaa ctt tac agg gat gat att cag eta atg acc age ttg ggc tat 192 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

5 0 5 5 6 0 

aat get tat agg ttc tec ata gag tgg age agg eta ttc cca gag gaa 240 

Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

aat aaa ttt aat gaa gat get ttc atg aaa tac egg gag att ata gac 288 

Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

85 90 95 

ttg tta ttg acg aga ggt ata act ccc ctg gtg ace eta cac cac ttt 336 

Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 

100 105 110 

act age ect etc tgg ttc atg aag aaa ggt ggc ttc ctt agg gag gag 384 

Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

aac eta aaa cat tgg gaa aag tac ata gaa aag gtt get gag ctt tta 432 

Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 

130 135 140 

gaa aaa gtt aaa eta gta get acc ttc aat gag ecg atg gta tac gta 480 

Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

145 150 155 160 

atg atg gga tat eta acg get tat tgg eec cca ttc att agg agt cca 528 

Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 



ttt aag gcc ttt aag gta get gca aac ctg ctt aaa get cac gca att 576 

Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala He 

180 185 190 

gcc tat gaa ctt ctt cat ggg aaa ttc aaa gtt gga ate gta aag aat 624 

Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 200 205 

att ccc ata ata etc cca gcg agt gae aag gag agg gat aga aaa gcc 672 

He Pro He He Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

get gag aaa get gat aat tta ttt aac tgg cac ttt ttg gat gcg ata 720 

Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala He 

225 230 235 240 

tgg agt ggg aaa tac aga ggg gta ttt aaa aca tat agg att ccc caa 768 

Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg He Pro Gin 

245 250 255 

agt gae gca gat ttc att ggg gtt aac tat tac acg gcc age gaa gta 816 

Ser Asp Ala Asp Phe He Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 2 65 27 0 

agg cat act tgg aat cct tta aaa ttc ttc ttt gag gtg aaa tta gcg 864 

Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

27 5 280 285 

gat att age gag agg aag act caa atg gga tgg age gtt tat cca aaa 912 

Asp He Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

gga ata tac atg gcc ctt aaa aaa get tec agg tat gga agg cct ctt 960 

Gly He Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 

305 310 315 320 

tat att acg gaa aac gga ata gcg acg ctt gat gat gaa tgg aga gtg 1008 

Tyr He Thr Glu Asn Gly He Ala Thr Leu Asp Asp Glu Trp Arg Val 



325 330 335 

gaa ttc ata att caa cac etc caa tac gtt cat aag get ate gaa gac 1056 
Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala lie Glu Asp 

340 345 350 

ggc ctg gat gta aga ggt tac ttc tat tgg tea ttt atg gat aac tac 1104 
Gly Leu Asp Val a rg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 360 365 

gag tgg aaa gag ggg ttt ggg eet aga ttt ggc eta gtg gaa gtt gat 1152 
Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

370 375 380 

tat caa acc ttc gag aga agg cec agg aag agt get tac gta tac gga 1200 
Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

gaa att gca aga agt aag gaa ata aag gat gag eta tta aag aga tat 1248 
Glu lie Ala Arg Ser Lys Glu lie Lys Asp Glu Leu Leu Lys Arg Tyr 

405 410 415 

ggc eta cca gaa ctt caa ctt 1269 
Gly Leu Pro Glu Leu Gin Leu 
42 0 



<210 > 2 
<211 > 423 
<212> PRT 

<213> Pyrococeus horikoshii 
< 4 0 0 > 2 

Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 
15 10 15 
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Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

20 25 3 0 

Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

3 5 4 0 4 5 

Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

50 55 60 

Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

65 70 75 80 

Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

85 90 95 

Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 

100 105 110 

Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

115 120 125 

Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 

130 135 140 

Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 
145 150 155 160 

Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

165 170 175 

Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

18 0 1 8 5 19 0 

Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

195 200 205 

lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

210 215 220 

Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 
225 23 0 235 24 0 

Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 
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245 250 255 

Ser Asp Ala Asp Phe He Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

260 265 270 

Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

275 280 285 

Asp He Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

290 295 300 

Gly He Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 
305 310 315 320 

Tyr He Thr Glu Asn Gly He Ala Thr Leu Asp Asp Glu Trp Arg Val 

325 330 335 

Glu Phe He He Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

340 345 350 

Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

355 .360 365 

Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

370 375 380 

Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 
385 390 395 400 

Glu He Ala Arg Ser Lys Glu He Lys Asp Glu Leu Leu Lys Arg Tyr 

405 410 415 

Gly Leu Pro Glu Leu Gin Leu 
420 



< 210 > 3 

< 211 > 57 
<212> DNA 

<213> Artificial Sequence 
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< 22 0 > 

<223> Description of Artificial Sequence:An upper primer designed to 
create 

the Ndel site. 

< 4 0 0 > 3 

taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 57 



<210> 4 

< 211 > 46 
<212> DNA 

<213> Artificial Sequence 

< 22 0 > 

<223> Description of Artificial SequencetA lower primer designed to 
create the BamHI site. 

< 4 0 0 > 4 

tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 
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